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TITLE : MUTANT PROTEINASE WITH REDUCED SELF-CLEAVAGE 
ACTIVITY AND METHOD OF PURIFICATION 

INVENTORS: Jennifer A, Doudna, Louise J. Lucast, and Robert T. Batey 

5 

FIELD OF THE INVENTION 

The present invention is related to compositions comprising mutant proteinase 
having reduced self-cleavage activity relative to the self-cleavage activity of its wild-type 
proteinase. The mutant proteinase has the same substrate cleavage-activity as its wild- 
10 type proteinase and it is more stable than its wild-type proteinase. The present invention 
is also related to methods of obtaining large quantities of purified active proteins that 
form inclusion bodies in cells. 

BACKGROUND 

15 Proteinases are present in both prokaryotic and eukaryotic systems and have been 

shown to play an important role in the processing of large precursor polyproteins during 
viral replication. The reliance of viruses on proteolytic processing by virally encoded 
proteases has been suggested to have several evolutionary advantages, such as a need for 
reduced genomic content and partial release from the constraints of the mechanisms of 

20 transcriptional and translational regulation of the host cell (Lawson et ah, 1990). Viruses 
have evolved methods for regulating the proteolytic cascade that produces viral structural 
and replication proteins to replace the host mechanisms. Lawson et ah (1991) report that 
in picornaviruses, the temporal and spatial distribution of expressed protease activity 
affects the appearance and location of the final proteolytic product. 

25 Potyviruses are members of the picornaviral superfamily that infect plants. Like 

other members of the picornaviral family, potyviruses make extensive use of proteinases 
during replication. An example of a potyvirus that possesses a genome encoding a single 
large polyprotein proteolytically processed by virally encoded protease is the tobacco etch 
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virus (TEV). TEV has a single-strand, plus-sense RNA genome of about 9,500 
nucleotides. The RNA is organized as a single open reading frame and encodes a 346 
kDa polyprotein (Allison et ah 9 1986). The polyprotein is co- and post-translationally 
processed by viral encoded proteinases. Two proteinases, PI and helper component 
5 proteinase (HC-Pro), are responsible for their autocatalytic release from the amino- 
terminus of the polyprotein (Verchot et ah 1991; Carrington et tf/,,1989). The third 
proteinase, nuclear inclusion proteinase (NIa), mediates all other cleavage events. 

Characterization of the 27 kDa NIa Proteinase of Potyvirus 

10 The NIa proteinase is 49 kDa and is found as an aggregate with the 54 kDa Nib 

polypeptide in nuclear inclusion bodies in infected plant cells (Carrington et ah, 1988; 
Parks et ah, 1995). The 49 kDa NIa proteinase is a picornavirus 3C-like proteinase that 
recognizes cleavage sites within the C-terminal two-thirds of the polyprotein. The 
proteolytic domain of NIa lies within the C-terminal half of the protein and has a 

1 5 molecular weight of about 27 kDa, while the N-terminal region of NIa contains the Vpg 
(viral protein, genome-linked) activity and has a molecular weight of about 21 kDa. 

Structurally, the 27 kDa NIa proteinase has been reported to be similar to the 
trypsin-like family of cellular serine proteinases, such as chymotrypsin or trypsin, with 
the substitution of Cys for serine as the active site nucleophile (Blazan et ah, 1990; 

20 Dougherty et ah 9 1989). Dougherty et ah (1989) disclose that the catalytic triad of 27 
kDa proteinase is composed of His, Asp, and Cys, being similar to the catalytic triad 
found in other viral proteinases (Dougherty et ah, 1989). However, unlike the other 
proteinases, the 27 kDa proteinase recognizes an extended heptapeptide sequence, E-X-X- 
Y-X-Ql S/G (positions P6-P1 1F1; X is any amino acid) (SEQ ID NO: 1), and cleaves 

25 within the heptapeptide sequence (Dougherty et ah, 1989a; Dougherty et ah 9 1988; 

Dougherty et ah, 1989b). Residues at positions P6, P3, PI, and P'l are conserved and 
essential for optimal cleavage. Amino acids at the other positions appear to modulate the 
rate at which cleavage occurs (Dougherty et ah, 1989; Dougherty and Parks, 1989). 
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Moreover, the 27 kDa NIa proteinase appears to be structurally and functionally 
similar to other plus-stranded RNA viral-encoded proteinases (Krausslich and Wimmer, 
1998). First of all, it cleaves the polyprotein between particular Gln-Gly or Glyn-Ser 
dipeptides. Secondly, proteolytic activity is enhanced by dithiothreitol. Thirdly, the gene 
5 encoding this proteinase is adjacent to the putative RNA-dependent, RNA-polymerase 
gene. Lastly, the proteinase contains a conserved C-terminal amino acid motif (Cys — 15 
amino acids—His) (Argos et ah, 1984). This last characteristic is shared by proteinases 
encoded by many RNA viruses that translationally express their genetic information as a 
single polyprotein from genome length RNA (Dougherty et al, 1989). 

10 Additionally, Parks et al (1995) report that the 27 kDa NIa proteinase contains an 

internal self-cleavage site positioned at 24 amino acids from the carboxyl terminus of the 
proteinase and that the active 27 kDa proteinase converts to a lower molecular weight 
form with time. The 27 kDa NIa proteinase lacking the C-terminal 24 amino acids 
exhibits limited activity. The truncated proteinase is about one-twentieth as efficient in 

15 proteolysis of a test peptide substrate as the Ml length form, and Parks et al (1995) 
indicate that the 27 kDa NIa proteinase appears to lose its activity with time. 

Further, Polayes et al. (1994) disclose that the 27 kDa NIa proteinase is a highly 
specific protease that is active under a broad temperature range and on a variety of 
substrates. Polayes et al report rapid cleavage at 30°C and 37 °C, about 80% cleavage at 

20 both 21 °C and 16°C in one hour, and 50% cleavage at 4°C. Accordingly, Polayes et al 
recommend the use of this proteinase as a tool for removing affinity tags from fusion 
proteins. 

Use of 27 kDa NIa Proteinase Cleavage System for Purification of Proteins 
25 Parks et al (1994) disclose an improved method for the production, cleavage, and 

purification of fusion proteins and peptides using the 27 kDa NIa proteinase. The method 
comprises producing a fusion protein comprising the protein of interest, a carrier peptide 
(such as an affinity carrier) and a 27 kDa NIa proteinase cleavage site inserted between 
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the two, purifying the fusion protein, and incubating the fusion protein with the 27 kDa 
NIa proteinase to remove the carrier peptide from the protein of interest. 

Johnston et ah, U.S. Patent 5,532,142, disclose a similar method of isolation and 
purification of recombinant proteins using the 27 kDa NIa proteinase. Like the 
5 purification method of Parks et ah{\ 994), the method of Johnston et ah involves 

producing large quantities of the fusion protein containing a desired protein fused to the 
27 kDa NIa proteinase cleavage site which is the carrier peptide, purifying the fusion 
protein, and incubating the purified fusion protein with the 27 kDa NIa proteinase to 
remove the carrier peptide from the desired protein. 

10 Unlike other proteinases, the 27 kDa proteinase exhibits high specificity, 

insensitivity to many proteinase inhibitors used in protein purification, and efficient 
cleavage under a broad range of temperatures (Polayes et ah, 1994). Moreover, the 
protein of interest is easily separated from the carrier peptide and the 27 kDa proteinase. 
For these reasons, there is an on-going interest in obtaining large quantities of the active 

1 5 protein for use as a tool in protein purification. 

Purification of Proteins Including the 27 kDa NIa Proteinase from Inclusion Bodies 

Purification of Proteins that Form Inclusion Bodies. The development of 
recombinant DNA technology has enabled the cloning and expression of proteins in 

20 bacteria, yeast and mammalian cells and has made it possible to produce therapeutics and 
industrially important proteins at economically feasible levels. However, the expression 
of high levels of recombinant proteins in Escherichia coli often results in the formation of 
inactive, denatured protein that accumulates in intracellular aggregates known as 
insoluble inclusion bodies (Krueger et ah 9 "Inclusion bodies from proteins produced at 

25 high levels in Escherichia coli," in Protein Folding, L. M. Gierasch and P. King (Eds), 
Am. Ass. Adv. Sci., 136-142 (1990); Marston, Biochem. J. 240:1-12 (1986); Mitraki et 
ah, Bio/Technol. 7: 800-807 (1989); Schein, Bio/Technol. 7:1 141-1 147 (1989); Taylor et 
ah, Bio/Technol. 4: 553-557 (1986)). Inclusion bodies are dense aggregates, which are 
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2-3 m in diameter and largely composed of recombinant protein, that can be separated 
from soluble bacterial proteins by low-speed centrifugation after cell lysis (Schoner et al, 
Biotechnology 3:151-154 (1985)). 

The recovery of recombinantly expressed protein in the form of inclusion bodies 
5 has presented a number of problems. First, although the inclusion bodies contain a large 
percentage of the recombinantly produced protein, additional contaminating proteins must 
be removed in order to isolate the protein of interest. Second, the proteins localized in 
inclusion bodies are in a form that is not biologically active, presumably due to incorrect 
folding. 

10 Several methods have been developed to obtain active proteins from inclusion 

bodies. These strategies include the separation and purification of inclusion bodies from 
other cellular components, solubilization and reduction of the insoluble material, 
purification of solubilized proteins and ultimately renaturation of the proteins and 
generation of native disulfide bonds. The art teaches that concentrations of 6 M or greater 

15 of chaotropic agents, such as guanidine hydrochloride, guanidine isothiocyanate or 

urea-are necessary for solubilization of the insoluble recombinant polypeptides from the 
inclusion bodies. See, for example, Vandenbroeck et al, Eur. J. Biochem. 215:481-486 
(1993); Meagher et al., Biotech. Bioeng. 43:969-977 (1994); Yang et al., U.S. Pat. No. 
4,705,848, issued Nov. 10, 1987; Weir et al., Biochem. J. 245:85-91 (1987); and Fischer, 

20 Biotech. Adv. 12:89-101 (1994). However, the use of high concentration of chaotropic 
agents, such as guanidine hydrochloride, to solubilize proteins denatures the proteins. 

U.S. Patent No. 5,912,327 discloses the use of low concentrations of guanidine 
salts, about 0.7 to about 3.5 M, to solubilize biologically active (i.e., correctly folded) 
proteins and extract this population of the protein from a heterogenous protein mixture 

25 localized in inclusion bodies. The method described in the patent comprises releasing the 
inclusion bodies containing the target protein from the cells by lysis, optionally washing 
the cells to remove cellular components, extracting with solutions containing low 
concentrations of guanidine salts, refolding target proteins which have been solubilized 
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using guanidine salts by rapid dilution of guanidine salt extracts and optionally employing 
agents which facilitate target protein refolding. The protein can then be recovered and 
purified by methods well known to the skilled artisan. However, this method is labor 
intensive. 

5 Tissue plasminogen activator (tPA or TP A) is one example of a pharmaceutical^ 

important drug produced by recombinant methods. Unfortunately the current methods for 
producing tPA from bacterial cell culture are both costly and laborious. The production of 
tPA in heterologous host organisms relies on the production of inactive tPA 
intracellularly in inclusion bodies, and the subsequent isolation and purification of such 

10 inclusion bodies, followed by activation of the tPA once freed from the inclusion bodies. 
U.S. Pat. No. 5,077,392 discloses a renaturation method for refolding denatured proteins 
obtained after expression in inclusion bodies. tPA is isolated as a denatured reduced 
protein and on subsequent oxidation refolded under oxidizing conditions to obtain what 
was reported as up to a 26% yield of "reactivated" protein. While the method appeared to 

15 improve polypeptide yield, the process involves multiple, time-consuming steps, due to 
the initial recovery of the insoluble, inactive protein. 

Purification of 27 kDa NIa Proteinase. The 27 kDaNIA proteinase has been 
especially difficult to isolate and purify in large quantities and in active form because of 

20 its proclivity to form inclusion bodies in nature. Previously published purification 

protocols of TEV nuclear inclusion bodies from infected plant tissue have demonstrated 
considerable proteolytic activity (Dougherty et aL, 1980). However, attempts to separate 
the 49 kDa NIa proteinase from the Nib protein and other components and to purify the 
NIa proteinase have resulted in loss of protein activity (Parks et aL, 1995). 

25 Parks et aL (1995) describe purification of the soluble fraction of recombinantly 

produced 27 kDa NIa proteinase. The purification method of Parks et aL involves 
overexpressing the recombinant form of the proteinase as a fusion protein comprising a 
seven-His tag at the N-terminus and purifying the fusion protein using two separate 
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columns, a nickel-nitrilotriacetic acid-agarose (Ni-agarose) column and a cation-exchange 
column. This method is labor-intensive and produces insufficient quantities of proteinase 
for use as a general tool in protein purification. 

Johnston et al 9 U.S. Patent 5,532,142, discloses recombinant vectors for 
5 overproducing plant virus proteinases in suitable hosts. Johnston et ah use the same 
purification protocol as that of Parks et al (1995) to purify the 27 kDa NIa proteinase. 
The yield of purified proteinase is typically in the range of 5 mg/liter of cell culture and 
not all of it is active. Thus, the yield of active protein is very low. 

As discussed above, the 27 kDa NIA proteinase contains an internal self-cleavage 
10 site that when cleaved, produces a proteinase with reduced substrate cleavage activity. At 
the present, there is no known method of stabilizing the proteinase, and there is no known 
method of obtaining large quantities of purified active 27 kDa NIa proteinase in large 
quantities. Accordingly, there is a need to develop a method of obtaining large quantities 
of purified active 27 kDa proteinase that will not cleave itself. 

15 

SUMMARY OF THE INVENTION 

The present invention provides mutant proteinases having a molecular weight of 
about 27 kDa and reduced self-cleavage activity relative to the self-cleavage activity of its 
wild-type proteinase. In one embodiment, the mutant proteinases of the present invention 
20 comprise an amino acid sequence in which the residue corresponding to Ser 219 of the 
wild-type 27 kDa NIa proteinase is replaced with another residue. In another 
embodiment, the mutant proteinases of the present invention comprises an amino acid 
sequence in which the residue corresponding to Ser 219 is replaced with Asn. 

The present invention also provides composition comprising the mutant 
25 proteinase. Preferably, the composition comprises a carrier in addition to the mutant. 

Moreover, the present invention provides fusion proteins comprising mutant 
proteinase having a molecular weight of about 27 kDa and reduced self-cleavage activity, 
fused to a heterologous polypeptide, fusion partner, or carrier protein. In one 
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embodiment, the heterologous polypeptide, fusion partner, or carrier protein comprises a 
protein that facilitates its isolation. In a preferred embodiment, the heterologous 
polypeptide consists of six histidines. 

The present invention includes nucleic acid molecules comprising a sequence 
5 encoding a mutant proteinase having a molecular weight of about 27 kDa and reduced 
self-cleavage activity relative to the self cleavage activity of to its wild-type proteinase. 
In one embodiment, the nucleic acid molecules encode mutant proteinases comprising an 
amino acid sequence in which the residue corresponding to Ser 219 of the 27 kDa Nla 
proteinase is replaced with another amino acid. In a preferred embodiment, the nucleic 
10 acid molecules encodes mutant proteinases comprising an amino acid sequence in which 
the residue corresponding to Ser 219 is replaced with Asn. 

The present invention also includes vectors, expression vectors, and host cells 
comprising a nucleic acid molecule encoding a mutant proteinase having a molecular 
weight of about 27 kDa and reduced self-cleavage activity relative to the self-cleavage 
15 activity of its wild-type proteinase. 

Further, the present invention includes nucleic acid molecules encoding fusion 
proteins comprising a proteinase, having a molecular weight of about 27 kDA and 
reduced self-cleavage activity relative to the self cleavage activity of its wild-type 
proteinase, fused to a heterologous protein. 
20 The present invention provides methods of producing a proteinase having a 

molecular weight of about 27 kDa and reduced self-cleavage relative to the self-cleavage 
activity of its wild-type proteinase comprising cultivating a host cell comprising a nucleic 
acid encoding the proteinase under conditions that allow expression of the proteinase. 

The present invention also provides a method of purifying a polypeptide that 
25 forms inclusion bodies in a cell comprising: 

a) obtaining cells expressing the polypeptide; 

b) lysing the cells; 

c) pelleting inclusion bodies by centrifugation of the lysed cells; 
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d) 
e) 



centrifuging the inclusion bodies; 

solubilizing the inclusion bodies in buffer containing solubilization 



agent; 



g) 
h) 

i) 



centrifuging the solubilized inclusion bodies to obtain a supernatant 
containing the polypeptide; 

loading the supernatant over a single denaturing column; 
collecting the proteinase; and 
renaturing the collected polypeptide. 



10 



15 



20 



In one embodiment, the polypeptide to be purified is a proteinase having a 
molecular weight of about 27 kDa and reduced self-cleavage activity relative to the self- 
cleavage activity of its wild-type proteinase. In another embodiment, the polypeptide is 
selected from the group consisting of tobacco etch virus (TEV) 27 kDa Nla proteinase 
and mutant 27 kDa Nla proteinase. In a preferred embodiment, the polypeptide is the 
mutant TPSN 27 kDa Nla proteinase, wherein the residue corresponding to Ser 219 of the 
wild-type 27 kDa Nla proteinase is replaced with Asn. 

Contemplated denaturing column of step (g) includes column containing Ni-NTA 
resin. Contemplated methods of lysing the cells include freeze-thaw cyles, sonication, 
and other enzymatic and mechanical means. The present method also contemplates the 
addition of proteinase inhibitors, preferably but not limited to PMSF, leupeptin, and 
pepstatin A, to the cells before lysis. The present invention includes the use of buffers, 
solubilizing agent, renaturing agents well-known to the skilled artisan for purifying 
proteins. 

Additionally, contemplated methods of using the proteinase having a molecular 
weight of 27 kDa and reduced self-cleavage activity, include cleavage of substrate. In one 
embodiment of using the proteinase, the proteinase is incubated with a protein for a 
sufficient amount of time to allow cleavage of the protein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows 27 kDa NIa proteinase purification— soluble vs insoluble preps. 
Samples from the soluble and insoluble purifications of TPWT (wild-type 27 kDa NIa 
proteinase) and TPSN (mutant form with Ser-Asn mutation) show that most of the 27 
kDa NIa proteinase was expressed in the insoluble fraction. Comparison of the final 
products from the different purifications shows that insoluble preparations produce 
greater yields with higher purity than soluble preparations. Overloaded, renatured TPWT 
and TPSN samples demonstrate sample purity as compared to a commercially available 
27 kDa NIa proteinase. 

Figure 2 shows specific activity for substrate cleavage by 27 kDa NIa proteinase. 
Specific activity is defined as picomoles substrate cleaved per picomole 27 kDa NIa 
proteinase in 1 hour at 30 °C. TPWT and TPSN were compared to a commercially 
available 27 kDa NIa proteinase (GibcoBRL) at several substrate concentrations. The 
specific activity appears to be comparable for all samples at the substrate concentrations 
tested. 

Figure 3 shows self-cleavage of 27 kDa NIa proteinase after a 5.5 week 
incubation. TPWT, TPSN, and a commercially available 27 kDa NIa proteinase 
(GibcoBRL) were incubated at various temperatures for 5.5 weeks at a concentration of 
approximately 187.5 g proteinase/ml (micrograms proteinase/ml) reaction. Comparison 
of the 4°C and -20 °C truncated 27 kDa NIa proteinase bands to those in the -80 °C 
samples indicates that self-cleavage activity is present in all samples at 4°C, but is absent 
from the TPSN sample at -20°C. 

Figures 4A and 4B show self-cleavage of 27 kDa NIa proteinase at 4°C and - 

20°C. 



WO 01/96539 



11 



PCT/US01/18620 



TPWT, TPSN, and a commercially available 27 kDa NIa proteinase(GibcoBRL) were 
incubated at various temperatures for 5.5 weeks. Aliquots removed at the timepoints 
shown were run on SDS-PAGE gels and quantitated using a gel documentation system. 
Data were reported as the percent intensity of the truncated 27 kDa NIa proteinase band 
5 compared to the total intensity of both the full-length and truncated bands. The graphs 
show that TPWT has comparable self-cleavage activity to the commercially available 27 
kDa NIa proteinase (assumed to have a wild type self-cleavage site) at both 4°C and 
-20 °C, while TPSN has reduced self-cleavage activity at 4°C and no self-cleavage 
activity at -20°C. 

10 

DETAILED DESCRIPTION OF THE INVENTION 
A. General Description 

The present invention is based on the unexpected discovery that a single amino 
acid change in the internal self-cleavage site will inhibit self-cleavage activity of the 27 
1 5 kDa NIa proteinase. Mutating Ser at position 2 1 9 of the wild-type 27 kDa NIa proteinase 
to Asn prevents the proteinase from cleaving itself. 

The present invention relates to expression, overproduction, and purification of 
virus proteinase. Sources of virus proteinase include flavi-, picorna- and potyviruses. 
The present method is considered particularly suited for overproducing potyvirus 
20 proteinases in E. coli or yeast cells. 

The present invention is based on the development of a single-column purification 
method for isolation of milligram quantities of >95% pure, active 27 kDa NIa proteinase 
from inclusion body preparation. The newly developed purification method yields about 
32 mg of total pure protein from the inclusion body preparation and greater than about 12 
25 mg is active. 

Accordingly, the present invention provides a method of obtaining large quantities 
of active 27 kDa NIa proteinase from inclusion body preparation. The present invention 
also provides a mutant 27 kDa NIa proteinase having the same substrate cleavage as the 
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wild-type 27 kDa NIa proteinase and a reduced self-cleavage activity. Thus, the present 
invention provides a 27 kDa NIa proteinase with increased shelf-life. 

B. Definitions 

5 Unless defined otherwise, all technical and scientific terms used in this 

specification shall have the same meaning as commonly understood by persons of 
ordinary skill in the art to which the present invention pertains. 

As used herein, "carrier" in a composition refers to a diluent, adjuvant, excipient, 
or vehicle with which the product is mixed. 
10 As used herein, "carrier peptide or protein", "fusion partner", or "heterologous 

protein or polypeptide" of a fusion protein refers to the portion of the fusion protein that is 
added to the protein of interest for the purpose of purification, for stability in production, 
or for other reasons. 

As used herein, "control sequence or element" or "regulatory sequence or element" 
15 refers to those non-translated regions of the vector, such as enhancers, promoters, 5' and 3 ? 
untranslated region, which interact with host cellular proteins to carry out transcription 
and translation. 

As used herein, "homologs" refers to proteins having the same or similar 
functions, especially proteins from different species having the same or similar functions. 
20 As used herein, "inclusion body" refers to distinctive structures frequently formed 

in the nucleus or cytoplasm in cells infected with certain filtrable viruses. They may be 
demonstrated by means of various stains. 

As used herein, "internal self-cleavage activity" or "internal autocatalytic activity" 
of a protein refers to the activity of cleaving at some site within the protein itself. 
25 As used herein, "internal self-cleavage site" or "internal autocatalytic site" is the 

site where the protein cleaves itself. 

As used herein, "isolated nucleic acid" refers to a nucleic acid that has been 
separated from its naturally occurring environment. 
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As used herein, "isolated polypeptide or protein" refers to a polypeptide or protein 
that has been separated from its naturally occurring environment. 

As used herein, "proteinase" and "protease" are interchangeable terms and refer to 
enzymes that hydrolyze (break) polypeptide chains. 
5 As used herein, "mutant proteinase" is a proteinase obtained by altering the nucleic 

acid encoding the wild-type proteinase and expressing the altered nucleic acid. A 
"mutant" is a phenotype in which a mutation is manifested. A "mutation" is a change in 
the chemistry of a nucleic acid that is perpetuated in subsequent divisions of the cell in 
which it occurs. 

10 As used herein, "specific activity" is defined as picomoles of substrate cleaved per 

picomole of proteinase in one hour at 30 °C. 

As used herein, "substrate cleavage activity" refers to the activity of a proteinase in 
cleaving a specific amino acid sequence with a specific amount of activity. 

As used herein, "wild-type proteinase" is a naturally occurring proteinase. 
15 As used herein, "27 kDa NIa proteinase or protease" refers to a proteinase having a 

molecular weight of about 27 kDa and having the same substrate cleavage activity as the 
wild-type 27 kDa Ma proteinase obtained from tobacco etch virus. 

C. Specific Embodiments 

20 1 . Nucleic Acids Enco ding Proteinases 

The present invention provides nucleic acid molecules encoding proteinases 
having the same substrate cleavage activity as the wild-type 27 kDa NIa proteinase, and 
preferably the nucleic acid molecules are in isolated form. In one embodiment, nucleic 
acid molecules provided by the present invention encode a proteinase having 27 kDa and 

25 reduced self-cleavage activity as compared to the self-cleavage activity of its wild-type 
proteinase. In another embodiment, nucleic acid molecules of the present invention 
encode a mutant form of the 27 kDa NIa proteinase comprising an amino acid substitution 
corresponding to position 219 of its wild-type proteinase, preferably Ser substituted with 
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Asn, having the same substrate cleavage activity as the wild-type proteinase and having 
reduced self-cleavage activity relative to its wild-type proteinase. 

The nucleic acid molecules of the invention include deoxyribonucleic acids 
(DNAs), both single- and double-stranded deoxyribonucleic acids. However, they can 
also be ribonucleic acids (RNAs), as well as hybrid RNArDNA double-stranded 
molecules. Contemplated nucleic acid molecules also include genomic DNA, cDNA, 
mRNA, and antisense molecules. The nucleic acids molecules of the present invention 
also include native or synthetic, RNA, DNA, or cDNA, that encode a proteinase protein, 
or the complementary strand thereof, including but not limited to nucleic acid found in a 
proteinase expressing organism, such as the tobacco etch virus. 

The nucleic acid sequence encoding the proteinase can be, for instance, 
substantially or folly synthetic. See, for example, Goeddel et al. 9 Proc. Natl. Acad. Sci. 
USA, 76, 106-1 10, 1979. For recombinant expression purposes, codon usage preferences 
for the organism in which such a nucleic acid is to be expressed are advantageously 
considered in designing a synthetic proteinase-encoding nucleic acid. Codon usage 
preferences for different organisms are well known to the skilled artisan. Since the 
nucleic acid code is degenerate, numerous nucleic acid sequences can be used to create 
the same amino acid sequence. 

The nucleic acid molecules of the present invention can encode proteinases having 
the same substrate cleavage activity as a wild-type 27 kDa NIa proteinase. For example, 
the nucleic acid molecules can encode a proteinase having 27 kDa and reduced self- 
cleavage activity or a mutant form of the 27 kDa NIa proteinase having the same substrate 
cleavage activity and having reduced self-cleavage activity as compared with its wild-type 
proteinase, piconaviral 3C proteinases, cellular serine proteinases such as chymotrypsin 
trypsin proteinases, proteinases that have internal self-cleavage sites similar to the internal 
self-cleavage site of the 27 kDa NIa proteinase, mutant forms of such proteinases with 
reduced self-cleavage activity as compared to its wild-type proteinase, and proteinases 
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that recognize the same extended seven amino acid sequence (E-X-X-Y-X-QIS/G) (SEQ 
ID NO: 1) substrate cleavage site as the 27 kDa NIa proteinase. 

In one embodiment, the nucleic acids of the present invention encode a proteinase 
having 27 kDa and reduced self-cleavage activity as compared to its wild-type proteinase. 
5 In another embodiment, the nucleic acids of the present invention encode a mutant form 
of the 27 kDa NIa proteinase comprising an amino acid sequence in which the residue 
corresponding to 219 of the 27 kDa NIa proteinase is replaced with another residue, 
preferably Asn. 

1 0 Nucleic Acid Molecules Encoding Mutant Forms and Allelic Forms of 

Proteinases. To construct mutant forms of proteinases having the same substrate 
cleavage activity as the wild-type 27 kDa NIa proteinase and having reduced self- 
cleavage activity, the nucleic acid encoding the wild-type proteinase can be used as a 
starting point and modified to form the desired mutants. For example, in the preferred 

1 5 embodiment, the nucleic acid sequence encoding the wild-type 27 kDa NIa proteinase is 
mutated such that the Ser corresponding to position 219 in the encoded amino acid 
sequence is replaced with another amino acid, preferably Asn. 

Further, with an altered amino acid sequence, numerous methods are known to 
delete sequence from or mutate nucleic acid sequences that encode a polypeptide and to 

20 confirm the function of the polypeptides encoded by these deleted or mutated sequences. 
Accordingly, the invention also relates to a mutated or deleted version of a proteinase 
nucleic acid that encodes a proteinase that has the same substrate cleavage activity as the 
wild-type 27 kDa NIa proteinase. 

Conservative variants of the wild-type 27 kDa NIa proteinases or its naturally 

25 occurring isoforms and homologs are encompassed by the present invention. Such 

conservative mutations include mutations that switch one amino acid for another within 
one of the following groups: 
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1. Small aliphatic, nonpolar or slightly polar residues: Ala, Ser, Thr, Pro and Gly; 

2. Polar, negatively charged residues and their amides: Asp, Asn, Glu and Gin; 

3. Polar, positively charged residues: His, Arg and Lys; 

4. Large aliphatic, nonpolar residues: Met, Leu, lie, Val and Cys; and 
5 5. Aromatic residues: Phe, Tyr and Trp. 

The types of substitutions selected may be based on the analysis of the frequencies of 
amino acid substitutions between homologous proteins of different species developed by 
Schulz et aL, Principles of Protein Structure, Springer-Verlag, 1978, pp. 14-16, on the 

10 analysis of structure-forming potentials developed by Chou and Fasman, Biochemistry 
13, 21 1, 1974 or other such methods reviewed by Schulz et al y Principles in Protein 
Structure, Springer-Verlag, 1978, pp. 108-130, and on the analysis of hydrophobicity 
patterns in proteins developed by Kyte and Doolittle, J. Mol. Biol. 157: 105-132, 1982. 
The present invention also contemplates nucleic acids encoding naturally 

1 5 occurring allelic variants of the proteinases described above. In a preferred embodiment, 
allelic variants even though possessing a slightly different amino acid sequence than the 
naturally occurring wild-type 27 kDa proteinase will have the requisite ability to 
recognize and cleave the heptapeptide sequence E-X-X-Y-X-Q i S/G (SEQ ID NO: 1). 
The present invention also contemplates conservative variants that do not affect the ability 

20 of the proteinase to recognize and cleave the heptapeptide sequence E-X-X-Y-X-Qi S/G 
(SEQ ID NO: 1). The present invention includes 27 kDa NIa proteinase with altered 
overall charge, structure, hydrophobic/hydrophilic properties by amino acid substitutions, 
insertions, or deletions but still possess the ability to recognize and cleave the heptapetide. 
Preferably, the nucleic acids will encode proteinases having the same substrate 

25 cleavage activity as the wild-type 27 kDa NIa proteinase and having at least about 70% 
sequence identity, more preferably, at least about 80% sequence identity, even more 
preferably, at least about 85% sequence identity, yet more preferably at least about 90% 
sequence identity, and most preferably at least about 95% sequence identity to a wild-type 
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27 kDa NIa proteinase or other naturally occurring isoforms having the same substrate 
cleavage activity. 

Numerous methods for determining percent homology are known in the art. One 
preferred method is to use version 6.0 of the GAP computer program for making sequence 

5 comparisons. The program is available from the University of Wisconsin Genetics 

Computer Group and utilizes the alignment method of Needleman and Wunsch, J. Mol. 
Biol. 48, 443, 1970, as revised by Smith and Waterman Adv. Appl. Math. 2, 482, 1981. 
Numerous methods for determining percent identity are also known in the art, and a 
preferred method is to use the FASTA computer program, which is also available from 

10 the University of Wisconsin Genetics Computer Group. 

Additionally, the invention includes substantially pure nucleic acids that hybridize 
under stringent conditions to a nucleic acid encoding a proteinase having the same 
substrate cleavage activity as the wild-type 27 kDa NIa proteinase. Stringent 
hybridization conditions are conditions in which hybridization to a labeled known nucleic 

15 acid sequence yields a clear and detectable sequence. Stringent conditions are those that 
(1) employ low ionic strength and high temperature for washing, for example, with 0.015 
M NaCl, 0.0015 M sodium titrate, 0.1% SDS at 50°C; (2) employ during hybridization 
a denaturing agent such as formamide, for example, 50% (vol/vol) formamide with 0.1% 
bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium 

20 phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42°C; and (3) 
employ 50% formamide, 5X SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5X Denhardt's solution, sonicated 
salmon sperm DNA (50 (ig/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, followed 
with washes at 42°C in 0.2X SSC and 0.1% SDS. A skilled artisan can readily determine 

25 and vary the stringency conditions appropriately to obtain a clear and detectable 
hybridization signal. 
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Nucleic Acid Molecules Encoding Fusion Proteins, The nucleic acid molecules of 
the present invention also encode fusion proteins comprising a proteinase such as those 
described in the previous paragraphs fused to a heterologous protein or polypeptide. In 
one embodiment, the fusion proteins of the present invention comprise the proteinase 
having 27 kDa and reduced self-cleavage activity as compared to its wild-type proteinase 
fused to a heterologous protein. In another embodiment, the fusion protein comprises the 
mutant 27 kDa NIa proteinase comprising an amino acid sequence in which the residue 
corresponding to Ser 219 of the 27 kDa NIa proteinase is replaced with another residue, 
preferably Asn, fused to a heterologous protein. 

Nucleic acids encoding various heterologous proteins may be fused to the nucleic 
acid encoding the proteinase. For example, nucleic acid encoding affinity tags such as the 
His tags, antibodies, or carrier peptides or proteins having specific binding properties such 
as Staphylococcus aureus protein A and the carbohydrate recognition domain (CRD) of 
galactose-specific rat hepatic lectin (Taylor and Drickamer, 1991), may be fused to the 
proteinase for purification of the proteinase by affinity chromatography. Nucleic acid 
encoding substrate cleavage sites of thrombin or factor X may be fused to the nucleic acid 
encoding the proteinase for ease of removal of carrier peptide. Additionally, fusion 
proteins tend to be more soluble than a single protein, contributing to higher yields and 
simpler purification. The fusion partner for the proteinase may be selected on the basis of 
transport characteristics to assure that the fusion protein is secreted into either the 
periplasmic space or the growth medium. Moreover, the fusion partner may also be 
selected for increasing the stability of the proteinase in the preferred expression system 
for obtaining large quantities of the protein. For example, fusion proteins are usually 
more stable in bacteria than the native eukaryotic proteinase. 

Nucleic Acid Encoding Proteins or Polypeptides Expressed in Inclusion Bodies. 
Nucleic acids encoding proteins, polypeptides, or fusion proteins or polypeptides that 
form inclusion bodies in cells are also encompassed by the present application. Such 
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nucleic acids can be expressed in host cells to produce large quantities of the proteins or 
polypeptides in inclusion bodies, as discussed below. The proteins or polypeptides are 
then purified by the purification method provided by the present invention, also discussed 
below. 

5 

2. Polype ptides or Proteins 

Polypeptides of the invention include all proteinases having the same substrate 
cleavage activity as the wild-type 27 kDa NIa proteinase, and preferably in isolated or 
purified form. The present invention also includes these proteinases in native or synthetic 
1 0 form, including but not limited to polypeptides purified from a proteinase-expressing 
organism. In one embodiment, the polypeptides of the present invention comprise a 
proteinase having 27 kDa and reduced self-cleavage activity as compared to its wild-type 
proteinase. In another embodiment, the polypepetides of the present invention comprise a 
mutant form of the wild-type 27 kDa NIa proteinase having the same substrate cleavage 
1 5 activity as the wild-type proteinase, but with reduced self-cleavage activity as compared 
to its wild-type proteinase, and having amino acid corresponding to position 2i9 of the 
wild-type NIa proteinase substituted with another amino acid, preferably Asn. 

Structurally, the 27 kDa NIa proteinase has been reported to be similar to the 
trypsin-like family of cellular serine proteinases, such as chymotrypsin or trypsin, with 
20 the substitution of Cys for serine as the active site nucleophile (Blazan et al, 1990; 

Dougherty et al, 1989). Dougherty et al (1989) discloses the catalytic triad of 27 kDa 
NIa proteinase to be composed of His, Asp, and Cys which is similar to the catalytic triad 
found in other viral proteinases (Dougherty et al, 1989). However, unlike the other 
proteinases, the 27 kDa proteinase recognizes an extended heptapeptide sequence, E-X-X- 
25 Y-X-Q i S/G (positions P6-P1 1 P* 1 ; X is any amino acid) (SEQ ID NO: 1), and cleaves 
within the heptapeptide sequence (Dougherty et al, 1989a; Dougherty et al, 1988; 
Dougherty et al, 1989b). 
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Proteinases having the same substrate cleavage activity as a wild-type 27 kDa NIa 
proteinase include but are not limited to polypeptides comprising the wild-type 27 kDa 
NIa proteinase or a mutant form thereof having the same substrate cleavage activity and 
having reduced self-cleavage activity as compared with its wild-type proteinase, 
piconaviral 3C proteinases, cellular serine proteinases such as chymotrypsin trypsin 
proteinases, proteinases that have internal self-cleavage sites similar to the internal self- 
cleavage site of the 27 kDa NIa proteinase, mutant forms of such proteinases with reduced 
self-cleavage activity as compared to its wild-type proteinase, and proteinases that 
recognize the same extended seven amino acid sequence (E-X-X-Y-X-QiS/G, SEQ ID 
NO: 1) substrate cleavage site as the 27 kDa NIa proteinase. 

In one embodiment, polypeptides of the present invention comprise a proteinase 
having a molecular weight of 27 kDa and reduced self-cleavage activity as compared to 
the wild-type 27 kDa NIa proteinase. In another embodiment, the polypeptides of the 
present invention comprise the mutant form of the 27 kDa NIa proteinase having an 

amino acid sequence in which the residue corresponding to Ser219 of the 27 kDa NIa 

i 

proteinase is replaced with another residue, preferably Asn. 

Mutant Forms and Allelic Forms of Proteinases. The present invention also 
include mutant forms of proteinases having the same substrate cleavage activity as the 
wild-type 27 kDa NIa proteinase and having reduced self-cleavage activity. As discussed 
above, to construct mutant forms of proteinases, the nucleic acid encoding the wild-type 
proteinase can be used as a starting point and modified to form the desired mutants. For 
example, in the preferred embodiment, the nucleic acid sequence encoding the wild-type 
27 kDa NIa proteinase is mutated such that Ser corresponding to position 219 in the 
encoded amino acid sequence is replaced with another amino acid, preferably Asn. 
Further, numerous methods are known to delete and mutate nucleic acid sequences that 
encode a polypeptide and' to confirm the function of the polypeptides encoded by these 
deleted or mutated sequences. Accordingly, the invention also provides mutated or 
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deleted version of a proteinase that has the same substrate cleavage activity as the wild- 
type 27 kDa NIa proteinase. 

Conservative variants of the wild-type 27 kDa NIa proteinases or its naturally 
occurring isoforms and homologs are encompassed. Such conservative mutations have 
5 been discussed under the previous section. The present invention also contemplates 
conservative variants that do not affect the ability of the proteinase to recognize and 
cleave the heptapeptide sequence E-X-X-Y-X-Ql S/G (SEQ ID NO: 1). The present 
invention includes 27 kDa NIa proteinase with altered overall charge, structure, 
hydrophobic/hydrophilic properties by amino acid substitutions, insertions, or deletions 
1 0 but still possess the ability to recognize and cleave the heptapeptide. 

The present invention also contemplates naturally occurring allelic variants of the 
proteinases having the same substrate cleavage activity as the 27 kDa NIa proteinase. In a 
preferred embodiment, allelic variants even though possessing a slightly different amino 
acid sequence than the naturally occurring wild-type 27 kDa proteinase will have the 
1 5 requisite ability to recognize and cleave the heptapeptide sequence E-X-X-Y-X-Q 1 S/G 
(SEQ ID NO: 1). 

Preferably, proteinases having the same substrate cleavage activity as the wild- 
type 27 kDa Ma proteinase and at least about 70% sequence identity, more preferably, at 
least about 80% sequence identity, even more preferably, at least about 85% sequence 
20 identity, yet more preferably at least about 90% sequence identity, and most preferably at 
least about 95% sequence identity to a wild-type 27 kDa NIa proteinase or other naturally 
occurring isoforms having the same substrate cleavage activity. 

Numerous methods for determining percent homology are known in the art. One 
preferred method is to use version 6.0 of the GAP computer program for making sequence 
25 comparisons. The program is available from the University of Wisconsin Genetics 

Computer Group and utilizes the alignment method of Needleman and Wunsch, J. Mol. 
Biol. 48, 443, 1970, as revised by Smith and Waterman Adv. Appl. Math. 2, 482, 1981. 
Numerous methods for determining percent identity are also known in the art, and a 
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preferred method is to use the FASTA computer program, which is also available from 
the University of Wisconsin Genetics Computer Group. 

Fusion Proteins. The present invention also provides fusion proteins comprising a 
5 proteinase having the same substrate cleavage activity as the 27 kDa NIa proteinase fused 
to a heterologous protein or polypeptide. In one embodiment, the fusion proteins of the 
present invention comprise a proteinase having a molecular weight of about 27 kDa and 
reduced self-clevage activity as compared to its wild-type proteinase fused to a 
heterologous protein. In another embodiment, the fusion proteins of the present invention 
1 0 comprise a mutant 27 kDa NIa proteinase having an amino acid sequence in which the 
residue corresponding to Ser219 of the 27 kDa NIa proteinase is replaced with another 
residue, preferably Asn, fused to a heterologous protein. 

As discussed earlier, various heterologous proteins may be fused to the proteinase 
of the present invention (see below also). 

Compositions. The present invention also provides compositions comprising an 
isolated proteinase having the same substrate cleavage activity as the wild-type 27 kDa 
NIa proteinase and a carrier. The composition may comprise a dry formulation or an 
aqueous solution. The carrier may be any compound that does not affect the substrate 
20 cleavage activity of the proteinase. Carrier could be a diluent, an excipient, or even a 

stabilizer. A specific example of a carrier could be buffer or water, which does not affect 
the stability of the proteinase. 

Uses of Proteinases with Identical Substrate Cleavage Activity as the 27 kDa NIa 
25 Proteinase. The present invention also provides methods of using proteinases with the 
same substrate cleavage activity as the 27 kDa NIa proteinase. Proteinases of the present 
invention can be used to cleave polypeptides comprising the heptapeptide sequence E-X- 
X-Y-X-Q I S/G (SEQ ID NO: 1). The mutant forms of the 27 kDa NIa proteinase with 
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decreased self cleavage activity are more stable than the wild-type proteinase and have a 
longer shelf-life. 

As discussed earlier, Parks et ah (1994) and Johnson et 0/., U.S. Patent 5,532,142, 
disclose the use of the 27 kDa NIa proteinase as a tool for purifying and obtaining large 
5 quantities of desired proteins. As shown by Parks et ah and Johnson et ah, to obtain large 
quantities of a desired protein, the protein is fused to a carrier protein and a substrate 
cleavage site recognized by the 27 kDa NIa proteinase is inserted between the two 
proteins. The 27 kDa NIa proteinase is selected for separating the carrier protein from the 
desired protein because the 27 kDa NIa proteinase exhibits unique characteristics. Unlike 

10 other proteinases, the 27 kDa proteinase exhibits high specificity, insensitivity to many 

proteinase inhibitors used in protein purification, and efficient cleavage under broad range 
of temperatures (Polayes et ah, 1994). Moreover, the protein of interest can be easily 
separated from the carrier peptide and the 27 kDa proteinase. 

The present invention provides mutant forms of the 27 kDa NIa proteinase with 

15 the same substrate activity as the wild-type proteinase and with decreased self-cleavage 
activity. The mutant 27 kDa NIa proteinases of the present invention are also useful as 
tools for purifying and obtaining large quantities of desired proteins. 

Proteins or Polypeptides that Form Inclusion Bodies. Proteins, polypeptides, 
20 fusion proteins or polypeptides that form inclusion bodies in cells are also encompassed 
by the present application. Such proteins, polypeptides, or fusion proteins, either 
produced by recombinant means or present in their native source, are then purified by the 
purification method provided by the present invention, discussed below. 

In a preferred embodiment, the fusion proteins comprising a protein of interest is 
25 fused to a carrier protein or fusion partner that facilitates its isolation. Examples of carrier 
proteins are not limited to any particular protein, but may be selected from a wide variety 
of proteins such as beta galactosidase, ubiquitin, glutathione S-transferase, alkaline 
phosphatase, maltose binding protein, Protein A, polyhistidines, monoclonal antibody 
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epitopes and so forth. Carrier proteins typically will be selected on the basis of 
characteristics contributing to easy isolation, most desirable being those that are readily 
secreted by the microorganisms or which have some property or feature which facilitates 
isolation and purification of the protein. Glutathione S-transferase, maltose binding 
protein and polyhistidine sequences, for example, are generally preferred because there 
are readily available affinity columns to which they can be bound and eluted. Other 
suitable fusion partners include antigenic tags that readily bind to corresponding 
antibodies or proteins that have special affinity properties, for example, selective binding 
to particular metals, as with polyhistidine peptide binding to nickel. 

3 . Recombinant Production of Proteinases 

Vectors and Expression vectors. The present invention provides vectors and 
expression vectors comprising a nucleic acid encoding a proteinase having the same 
substrate cleavage activity as the wild-type 27 kDa Nla proteinase. In a preferred 
embodiment, the vectors or expression vectors comprise a nucleic acid encoding a 
proteinase having a molecular weight of about 27 kDa and reduced self-cleavage activity 
as compared to its wild-type proteinase. In a more preferred embodiment, the vectors or 
expression vectors comprise the nucleic acid encoding a mutant form of the 27 kDa Nla 
proteinase having the same substrate cleavage activity as its wild-type 27 kDa Nla 
proteinase and reduced self-cleavage activity as compared to its wild-type proteinase. 

The present invention also provides vectors and expression vectors containing the 
nucleic acids encoding fusion proteins and encoding any protein that forms inclusion 
bodies in cells. Preferably, the fusion proteins comprise a proteinase having the same 
substrate cleavage activity as the 27 kDa Nla proteinase and a heterologous protein. More 
preferably, the proteinase of the fusion protein is a mutant form of 27 kDa Nla proteinase 
having reduced self-cleavage activity and has an amino acid sequence in which the 
residue corresponding to position 240 is replaced with another amino acid, preferably 
Asn. 
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Vectors or cassettes useful for the transformation and transfection of suitable host 
cells are well known in the art. Typically, the vectors or cassettes contain sequences 
directing transcription and/or translation of the relevant gene, a selectable marker, and 
sequences allowing autonomous replication or chromosomal integration. In an 
5 autonomously replicating vector, le. 9 a vector which exists as an extrachromosomal 
entity, the replication of which is independent of chromosomal replication, e.g., a 
plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. 
The vector may contain any means for assuring self-replication. Alternatively, the vector 
may be one which, when introduced into the host cell, is integrated into the genome and 
1 0 replicated together with the chromosome(s) into which it has been integrated. 

Furthermore, a single vector or plasmid or two or more vectors or plasmids which 
together contain the total DNA to be introduced into the genome of the host cell, or a 
transposon may be used. 

Suitable vectors for the present invention comprise a region 5 T of the gene which 
15 harbors transcriptional initiation regulation or control and a region 3' of the DNA 

fragment which regulates transcriptional termination. It is most preferred when both 
regulatory regions are derived from nucleic acids homologous to the transformed host 
cell, although it is to be understood that such regulatory regions need not be derived from 
the nucleic acids native to the specific species chosen as a production host. 
20 The "control elements" or "regulatory sequences" are those non-translated regions 

of the vector-enhancers, promoters, 5' and 3 1 untranslated regions-which interact with 
host cellular proteins to carry out transcription and translation. Such elements may vary 
in their strength and specificity. Depending on the vector system and host utilized, any 
number of suitable transcription and translation elements, including constitutive and 
25 inducible promoters, may be used. For example, when cloning in bacterial systems, 
inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT phagemid 
(Stratagene, LaJolla, Calif.) or PSPORT1 plasmid (Gibco BRL) and the like may be used. 
The baculovirus polyhedrin promoter may be used in insect cells. Promoters or 
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enhancers derived from the genomes of plant cells (e.g., heat shock, RUBISCO; and 
storage protein genes) or from plant viruses (e.g., viral promoters or leader sequences) 
may be cloned into the vector. In mammalian cell systems, promoters from mammalian 
genes or from mammalian viruses are preferable. If it is necessary to generate a cell line 
5 that contains multiple copies of the sequence encoding the proteinase or protein formed in 
inclusion bodies, vectors based on SV40 or EBV may be used with an appropriate 
selectable marker. 

In bacterial systems, a number of expression vectors may be selected depending 
upon the use intended for the proteinase or protein that forms inclusion bodies in a cell. 

1 0 For example, when large quantities of the protein are needed for the induction of 

antibodies or for use as a tool in the purification of proteins, vectors which direct high 
level expression of fusion proteins that are readily purified may be used. Such vectors 
include, but are not limited to, the multifunctional E. coli cloning and expression vectors 
such as BLUES CRIPT (Stratagene), in which the sequence encoding the proteinase may 

15 be ligated into the vector in frame with sequences for the amino-terminal Met and the 
subsequent 7 residues of P-galactosidase so that a hybrid protein is produced. An 
example of such a vector include pIN vectors (Van Heeke, G. and S. M. Schuster (1989) 
J. Biol. Chem. 264:5503-5509). 

pGEX vectors (Promega, Madison, Wis.) are used to express foreign polypeptides 

20 as fusion proteins with a heterologous protein such as glutathione S-transferase (GST). In 
general, fusion proteins are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in the presence of free 
glutathione. Proteins made in such systems may be designed to include a protease 
cleavage site, such as the substrate cleavage site of the 27 kDaNIa proteinase, so that the 

25 purified polypeptide of interest can be easily released from the GST moiety. The fusion 
proteins may also comprise a preferred proteinase or any protein that forms inclusion 
bodies in a cell and a carrier peptide or protein, such as the His tag, for affinity 
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purification of the proteinase or protein. A further discussion of vectors which contain 
fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell Biol. 12:441-453). 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
5 be used (Grant et al (1987) Methods Enzymol. 153:516-544). In cases where plant 
expression vectors are used, the expression of sequences encoding the proteinase or 
protein that forms inclusion bodies in a cell may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S promoters of CAMV 
may be used alone or in combination with the omega leader sequence from TMV 

10 (Takamatsu, N. (1987) EMBO J. 6:307-31 1). Alternatively, plant promoters such as the 
small subunit of RUBISCO or heat shock promoters may be used (Coruzzi, G. et al 
(1984) EMBO J. 3:1671-1680; Broglie, R. et al (1984) Science 224:838-843; and Winter, 
J. et al (1991) Results Probi. Cell Differ. 17:85-105). These constructs can be introduced 
into plant cells by direct DNA transformation or pathogen-mediated transfection. Such 

15 techniques are described in a number of generally available reviews (see, for example, 

Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, 
New York, N.Y.; pp. 191-196). 

An insect system may also be used to express the proteinase or protein that forms 
inclusion bodies in a cell. For example, in one such system, Autographa californica 

20 nuclear polyhidrosis virus (AcNPV) is used as a vector to express foreign genes in 
Spodoptera frugiperda cells or in Trichoplusia larvae. The sequences encoding the 
protein may be cloned into a non-essential region of the virus, such as the polyhedrin 
gene, and placed under control of the polyhedrin promoter. Successful insertion of the 
protein will render the polyhedrin gene inactive and produce recombinant virus lacking 

25 coat protein. The recombinant viruses may then be used to infect, for example, S. 

frugiperda cells or Trichoplusia larvae in which PLBP may be expressed (Engelhard, E. 
K. el al (1994) Proc. Nat. Acad. Sci. 91:3224-3227). 
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In mammalian host cells, a number of viral-based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, sequences 
encoding the proteinase or protein that form inclusion bodies in a cell may be ligated into 
an adenovirus transcription/translation complex consisting of the late promoter and 
5 tripartite leader sequence. Insertion in a non-essential El or E3 region of the viral 
genome may be used to obtain a viable virus which is capable of expressing PLBP in 
infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655-3659). 
In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may 
be used to increase expression in mammalian host cells. 

10 Specific initiation signals may also be used to achieve more efficient translation of 

sequences encoding the proteinase or protein formed in inclusion bodies. Such signals 
include the ATG initiation codon and adjacent sequences. In cases where sequences 
encoding a protein, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational regulatory 

15 signals may be needed. However, in cases where only coding sequence, or a fragment 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 
reading frame to ensure translation of the entire insert. Exogenous translational elements 
and initiation codons may be of various origins, both natural and synthetic. The 

20 efficiency of expression may be enhanced by the inclusion of enhancers which are 

appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al (1994) Results Probl. Cell Differ. 20:125-162). 

Host Cells. The present invention also provides host cells, comprising a nucleic 
25 acid sequence encoding a proteinase as described above or a protein that forms inclusion 
bodies in a cell, which are used in the recombinant production of the encoding the 
proteinase or protein. A vector comprising the nucleic acid sequence of the present 
invention is introduced into a host cell so that the vector is maintained as a chromosomal 
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integrant or as a self-replicating extra-chromosomal vector as described earlier. The 
choice of a host cell will to a large extent depend upon the gene encoding the polypeptide 
and its source. The host cell may be a unicellular microorganism, e.g., a prokaryote, or a 
non-unicellular microorganism, e.g., a eukaryote. The host cell may be a eukaryote 
5 selected from the group consisting of mammalian cell, insect cell, plant cell or fungal 
cell. 

Useful unicellular cells are bacterial cells such as gram positive bacteria including, 
but not limited to, a Bacillus cell, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, 
Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus lautus, 

10 Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus stearothermophilus, 
Bacillus subtilis, and Bacillus thuringiensis; or a Streptomyces cell, e.g., Streptomyces 
lividans or Streptomyces murinus, or gram negative bacteria -such as E. coli and 
Pseudomonas sp. In a preferred embodiment, the bacterial host cell is E. Coli. Various 
strains of E. coli {e.g., HB101, DH5 , DH10, and MC1061) are well-known as host cells in 

15 the field of biotechnology. 

Mammalian cells, such as Chinese hamster ovary cells (CHO) or 3T3 cells may be 
used in the present invention. The selection of suitable mammalian host cells and 
methods for transformation, culture, amplification, screening and product production and 
purification are known in the art. Other suitable mammalian cell lines, are the monkey 

20 COS-1 and COS-7 cell lines, and the CV-1 cell line. Further exemplary mammalian host 
cells include primate cell lines and rodent cell lines, including transformed cell lines. 
Normal diploid cells, cell strains derived from in vitro culture of primary tissue, as well as 
primary explants, are also suitable. Candidate cells may be genotypically deficient in the 
selection gene, or may contain a dominantly acting selection gene. Other suitable 

25 mammalian cell lines include but are not limited to, HeLa, mouse L-929 cells. 3T3 lines 
derived from Swiss, Balb-c or NIH mice, BHK or HaK hamster cell lines. 

Many strains of yeast cells known to those skilled in the art are also available as 
host cells for expression of the polypeptides of the present invention. Additionally, where 
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desired, insect cells may be utilized as host cells in the method of the present invention 
(Miller et al, 1286 Genetic Engineering 8:277-298). 

The introduction of a vector into a bacterial host cell may, for instance, be effected 
by protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General 
5 Genetics 168: 1 1 1-1 15), using competent cells (see, e.g., Young and Spizizin, 1961, 

Journal of Bacteriology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of 
Molecular Biology 56: 209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, 
Biotechniques 6: 742-751), or conjugation (see, e.g., Koehler and Thorne, 1987, Journal 
of Bacteriology 169: 5771-5278). 

10 Host cells transformed with nucleotide sequences encoding proteinase and protein 

that form inclusion bodies may be cultured under conditions suitable for the expression 
and recovery of the proteinase from cell culture. The protein produced by a transformed 
cell may be secreted, contained intracellularly, or contained with the inclusion factor 
depending on the sequence and/or the vector used. As will be understood by those of skill 

15 in the art, expression vectors containing polynucleotides which encode the protein may be 
designed to contain signal sequences which direct secretion of the protein through a 
prokaryotic or eukaryotic cell membrane. 

Other constructions may be used to join sequences encoding the protein to 
nucleotide sequence encoding a polypeptide domain which will facilitate purification of 

20 soluble proteins. Such purification facilitating domains include, but are not limited to, 
metal chelating peptides such as histidine-tryptophan modules that allow purification on 
immobilized metals, protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification 
system (hnmunex Corp., Seattle, Wash.). The inclusion of cleavable linker sequences 

25 such as those specific for Factor XA or enterokinase (Invitroger, San Diego, Calif.) or the 
27 kDa NIa proteinase cleavage site between the purification domain and the protein may 
be used to facilitate purification. One such expression vector provides for expression of a 
fusion protein containing the protein and a nucleic acid encoding 6 histidine residues 
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preceding a thioredoxin or an enterokinase cleavage site. The histidine residues facilitate 
purification on IMAC (immobilized metal ion affinity chromatography) as described in 
Porath, J. et al (1992, Prot. Exp. Purif. 3: 263-281) while the enterokinase cleavage site 
provides a means for purifying from the fusion protein. 
5 In addition, a host cell strain may be chosen for its ability to modulate the 

expression of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
10 used to facilitate correct insertion, folding and/or function. Different host cells which 
have specific cellular machinery and characteristic mechanisms for post-translational 
activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from the 
American Type Culture Collection (ATCC; 10801 University Boulevard, Manassas, Va., 
201 10-2209) and may be chosen to ensure the correct modification and processing of the 
15 foreign protein. 

For long-term, high-yield production of proteinases or proteins that form inclusion 
bodies in a cell, stable expression is preferred. For example, cell lines which stably 
express proteinase or proteins that form inclusion bodies may be transformed using 
expression vectors which may contain viral origins of replication and/or endogenous 
20 expression elements and a selectable marker gene on the same or on a separate vector. 

Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an 
enriched media before they are switched to selective media. The purpose of the selectable 
marker is to confer resistance to selection, and its presence allows growth and recovery of 
cells which successfully express the introduced sequences. Resistant clones of stably 
25 transformed cells may be proliferated using tissue culture techniques appropriate to the 
cell type. 

Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler, 
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M. et al (1977) Cell 1 1 :223-32) and adenine phosphoribosyltransferase (Lowy, I. et al 
(1980) Cell 22:817-23) genes which can be employed in tk~ or aprT cells, respectively. 
Also, antimetabolite, antibiotic or herbicide resistance can be used as the basis for 
selection; for example, dhfr which confers resistance to methotrexate (Wigler, M. et aL 
5 (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which confers resistance to the 

aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 
150:1-14); and als or pat, which confers resistance to chlorsulfuron and phosphinotricin 
acetyltransferase, respectively (Murry, supra). Additional selectable genes have been 
described, for example, trpB, which allows cells to utilize indole in place of tryptophan, 
10 or hisD, which allows cells to utilize histinol in place of histidine (Hartman, S. C. and R. 
C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-51). Recently, the use of visible 
markers has gained popularity with such markers as anthocyanins, -glucuronidase and its 
substrate GUS, and luciferase and its substrate luciferin, being widely used not only to 
identify transformants, but also to quantify the amount of transient or stable protein 
15 expression attributable to a specific vector system (Rhodes, C. A. et al (1995) Methods 
Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that the gene 
of interest is also present, its presence and expression may need to be confirmed. For 
example, if the sequence encoding a proteinase or a protein that forms inclusion bodies in 
20 a cell, is inserted within a marker gene sequence, transformed cells containing sequences 
encoding the protein can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding protein 
under the control of a single promoter. Expression of the marker gene in response to 
induction or selection usually indicates expression of the tandem gene as well, 
25 Alternatively, host cells which contain the nucleic acid sequence encoding the protein and 
express the protein may be identified by a variety of procedures known to those of skill in 
the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridizations and substrate cleavage assay or immunoassay techniques which include 
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membrane, solution, or chip based technologies for the detection and/or quantification of 
nucleic acid or protein. 

Production of Protein from Host Cells. The present invention also provides 
5 methods for producing a proteinase or a protein that forms inclusion body in a cell 
comprising (a) cultivating the host cell under conditions that allow expression of the 
protein; and (b) recovering the protein. The proteinase of the present invention has a 
substrate cleavage activity that is identical to that of the 27 kDa NIa proteinase. 
Preferably, the proteinase has a molecular weight of about 27 kDa and a reduced self- 
1 0 cleavage activity as compared to its wild-type proteinase. More preferably, the proteinase 
is a mutant form of the 27 kDa NIa proteinase, and comprises an amino acid sequence in 
which the residue corresponding to position 219 of the wild-type 27 kDa NIa is replaced 
with another amino acid, preferably Asn. 

In the production methods of the present invention, the cells are cultivated in a 
1 5 nutrient medium suitable for production of the proteins of the present invention using 
methods known in the art. For example, the cell may be cultivated by shake-flask 
cultivation, small-scale or large-scale fermentation (including continuous, batch, 
fed-batch, or solid state fermentations) in laboratory or industrial fermenters performed in 
a suitable medium and under conditions allowing the polypeptide to be expressed and/or 
20 isolated. The cultivation takes place in a suitable nutrient medium comprising carbon and 
nitrogen sources and inorganic salts, using procedures known in the art. Suitable media 
are available from commercial suppliers or may be prepared according to published 
compositions (e.g., in catalogues of the American Type Culture Collection). If the protein 
is secreted into the nutrient medium, the polypeptide can be recovered directly from the 
25 medium. If the protein is not secreted, it can be recovered from cell lysates as described 
below 

The proteins may be detected using methods known in the art that are specific for 
the polypeptides. These detection methods may include use of specific antibodies, 
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formation of an enzyme product, or disappearance of an enzyme substrate. For example, 
an enzyme assay may be used to determine the activity of the protein. 

The resulting protein may be recovered by methods described below. 



5 4. Purification of Proteins 

General Procedure for Purification of Proteins. The proteins of the present 
invention produced from host cells may be recovered by methods known in the art. For 
example, the polypeptide may be recovered from the nutrient medium by conventional 
procedures including, but not limited to, centrifugation, filtration, extraction, 
10 spray-drying, evaporation, or precipitation. 

The proteins of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, 
affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures 
(e.g., preparative isoelectric focusing), differential solubility (e.g, ammonium sulfate 
15 precipitation), SDS-PAGE, or extraction (see, e.g., Protein Purification, J.-C. Janson and 
Lars Ryden, editors, VCH Publishers, New York, 1989). 

In one embodiment of the invention, when the protein is secreted into the culture 
medium, the step of collecting the conditioned culture medium is followed by the step of 
purifying the protein. The protein in the collected medium may be purified by any 
20 method known in the art, such as selective precipitation, affinity chromatography, 
dialysis, immunoprecipitation, ion-exchange chromatography, size-exclusion 
chromatography, hydrophobic interaction chromatography, or reversed-phase 
chromatography (Guide to Protein Purification, in Methods in Enzymology 182 (Murray 
P. Deutscher ed., 1990), Remington: The Science and Practice of Pharmacy 534-58 
25 (Alfonso R. Gennaro ed., 19th ed. 1995)). Chromatography can be carried out using 

conventional columns or by HPLC (high performance liquid chromatography) or FPLC 
(fast protein liquid chromatography). 
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In another embodiment of the invention, the protein is purified by anion-exchange 
chromatography. Methods of purifying proteins using anion-exchange chromatography 
are well known to those skilled in the art (See, e.g., Edward R. Rossomando, 
Ion-Exchange Chromatography, in Guide to Protein Purification, Methods of 
5 Enzymology 182, 309-16 (Murray P. Deutscher ed., 1990)). In an alternate embodiment 
of the invention, the protein is purified by affinity chromatography over a 
protamine-agarose column, such as a protamine-Sepharose® (Pharmacia-LKB) column. 
Methods of purifying the proteins of the present invention using protamine-agarose 
columns are known to those skilled in the art (Dempfle and Heene, Thromb. Res. 46, 19 
10 (1 987)). Preferably, the proteins of the present invention are purified by affinity 
chromatography using a column containing Ni-NTA Superflow resin (nickel- 
nitilotriacetic acid-agarose, Qiagen) 

In an alternate embodiment of the invention, the protein is purified by 
immunoaffmity chromatography using polyclonal or monoclonal antibodies raised against 
1 5 the intact protein or peptides thereof of the present invention. Methods of producing and 
using polyclonal and monoclonal antibodies are well known in the art (Ed Harlow & 
David Lane, Antibodies: A Laboratory Manual (1988); Norman A. Staines, Monoclonal 
Antibodies, in Biochemical Research Techniques: A Practical Introduction (John M. 
Wrigglesworth Ed., 1983)). Likewise, methods of performing affinity chromatography 
20 using polyclonal and monoclonal antibodies are also well known in the art. (Ed Harlow 
& David Lane, Antibodies: A Laboratory Manual (1988); Steven Ostrove, Affinity 
Chromatography. General Methods, in Guide to Protein Purification, Methods of 
Enzymology 182, 357-71 (Murray P. Deutscher ed., 1990)). 

In another embodiment of the invention, when the protein is not secreted into the 
25 culture medium, it is necessary to remove the protein from the cell by lysing the cells 
using methods well known in the art such as sonication or freeze-thawing, followed by 
isolating the protein from the cell extract. The cell extract may be prepared or incubated 
in the presence of cell protease inhibitors. The plant virus cleavage site is not susceptible 
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to proteolysis by ordinary cell proteases. The plant virus proteinase itself is unaffected by 
ordinary protease inhibitors so that such inhibitors may be added in amounts sufficient to 
inactivate cell proteases. Typical and commonly used cell protease inhibitors include 
leupeptin, pepstatin A, PMSF, E-64, TLCK, bestatin and aprotinin. However, any of a 
5 number of proteinase inhibitors may be employed so long as they are not inhibitors of the 
proteinase used to release a foreign protein from the carrier protein. The practitioner will 
.typically culture in a media or grow in an environment suitable for the host selected, 
prepare cell extract, then add appropriate cell protease inhibitors. The desired protein 
may be purified using standard procedures such as chromatography, electrophoresis or 
1 0 density gradient centrifugation. 

The above methods are also applicable to purifying proteins from their native 

source. 

Procedure for Purification of Proteins that Form Inclusion bodies. U.S. Pat. No. 

1 5 5,989,554 provides a general method for isolating and purifying proteins that form 

inclusion bodies in cells. First, the cells are lysed by enzymatic or mechanical means in a 
buffer. The preferred method is sonication, although any other lysis method will work, as 
long as lysis is complete and DNA and RNA are sufficiently fragmented so as not to 
pellet upon centrifugation. Preferred buffers contain Tris buffer at pH 7-8, isotonic saline, 

20 and dithiothreitol (DTT) to maintain all cell proteins in a reduced state. After sonication, 
detergent is added to the mixture to solubilize most lipids and proteins, and the mixture is 
centrifuged; it is preferred to use a centrifuge speed of greater than 10,000g for 10 
minutes. The desired protein is then found in the pellet fraction at a high degree of purity. 
Higher purity is usually obtained by washing the pellet in a second wash solution, 

25 often containing a different agent or detergent. Washing is accomplished by resuspending 
the pellet in the fresh buffer followed by centrifugation as above. The preferred first 
detergent is sodium deoxycholate (NaDOC), and the second preferred detergent is Triton 
X-100. 
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After the detergent washes, the pellet can be washed either with the above buffer 
or with phosphate-buffered saline (PBS) to remove trace detergent, then resuspended in a 
volume of a desired buffer for storage or use in any of the ways described elsewhere, 
including immunization. 
5 The proteins purified by the above method are ready for use in any or all of the 

applications contemplated in the invention, including but not limited to the following; 
immunization of animals, use as an adjuvant, coupling to other ligands, use as a 
protease inhibitor, immobilization on hydrophobic surfaces, use as an enzyme substrate, 
and use in peptide production after cleavage. 
10 As discussed earlier, this method is labor-intensive and does not provide large 

quantities of active protein. 

Novel Method for Purifying Proteins that Form Inclusion Bodies. The present 
invention is based on the development of a single-column purification method for 
1 5 preparing milligram quantities of >95% pure, active 27 kDa NIa proteinase. The method 
comprises the following steps: 

a) obtaining cells expressing the polypeptide; 

b) lysing the cells; 

c) pelleting inclusion bodies by centrifugation of the lysed cells; 
20 d) centrifuging the inclusion bodies; 

e) solubilizing the inclusion bodies in buffer containing solubilization 

agent; 

f) centrifuging the solubilized inclusion bodies to obtain a supernatant 
containing the polypeptide; 

25 g) loading the supernatant over a single denaturing column; 

h) collecting the proteinase; and 

i) renaturing the collected polypeptide. 
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Preferably, the cells are lysed by incubating for 30 min at 4°C in 50 mM Tris-Cl, pH 8.0, 
. 300 mM NaCl, 500 fig/ml lysozyme, 200 jig/ml DNase I, and protease inhibitors (such as 
PMSF and leupeptin); followed by three freeze-thaw cycles; addition of Triton X-100 to a 
final concentration of 1%; and vortexing. Preferably, following lysis the inclusion bodies 
5 are pelleted by centrifugation at 10,000 x g. Preferably, the inclusion bodies are then 

solubilized with 6 M GuHCl, 100 mM NaH 2 P0 4 and 10 mM Tris-Cl, pH 8.0. Preferably, 
before loading on the denaturing column the solubilized inclusion bodies are centrifuged 
for 20 minutes at 15,000 x g. Preferably, the denaturing column contains Ni-NTA (nicke- 
nitilotiacetic acid-agarose) Superflow resin (Qiagen). Preferably, the proteinase is eluted 
10 with 6 M urea, 100 mM NaH 2 P0 4 , and 10 mM Tris-Cl, pH 4.5, and the collected 

proteinase fraction is adjusted with 10 N NaOH to pH 8.5, and renatured during dialysis 
for 4-8 hours in storage buffer (100 mM Tris-Cl, pH 8.5, 500 mM NaCl, 10% Glycerol, 5 
mM DTT, 0.5 mM EDTA). Preferably, as a final step, the dialyzed proteinase fraction is 
centrifuged for 30 minutes at 15,000 x g at 4°C to collect the supernatant containing the 
1 5 active renatured proteinase. 

Other proteinase inhibitors, methods for lysing the cells, buffers, solubilizing 
agents, and methods of renaturing the purified protein are well known to the skilled 
artisan, and some are described in the previous sections of the specification. It is also 
pointed out that any host cell that expresses large quantities of the protein of interest in 
20 the form of inclusion bodies or any cell that endogenously expresses the protein in the 

form of inclusion bodies are useful for purifying large quantities of the protein of interest. 

In a preferred embodiment, the protein to be isolated in large quantities is 
expressed in E. coli as a fusion protein comprising in addition to the protein, a proteinase 
cleavage site, and a fusion partner. Proteinase cleavage sites, as well as fusion partner or 
25 carrier protein, are well known to the skilled artisan and have been described in detail in 
other sections of the present specification. A preferred proteinase cleavage site is the site 
recognized by the 27 kDa NIa proteinase and the preferred carrier protein is six His tags. 
The histidines enables binding to the Ni-NTA column. The 27 kDa NIa proteinase, unlike 
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other proteinases, exhibits high specificity, insensitivity to many proteinase inhibitors 
used in protein purification, and efficient cleavage under broad range of temperatures 
(Polayes et ah, 1994). Moreover, the protein to be purified is easily separated from the 
carrier peptide and the 27 kDa NIa proteinase. 

In light of the foregoing general discussion, the specific examples presented below 
are illustrative only and are not intended to limit the scope of the invention. Other generic 
and specific configurations will be apparent to those persons skilled in the art. 



EXAMPLES 
10 Example 1. Cloning 

A wild type 27 kDa NIa nucleic acid was generated by PCR using pTL-5495 
(ATCC 45036) as the template, 5TEVP1 : 5'-CAT CAG CGG GCC ATG GCT GAA 
AGC TTG TTT AAG-3' (SEQ ID NO: 2) as the 5' primer, and 3TEVH1: 5'-CTG ATG 
CAC GGA TCC TCA TTA ATG GTG ATG GTG ATG GTG CAA TTG CGA GTA 
15 GAC TAA TTC ACT CAT G-3' (SEQ ID NO: 3) as the 3' primer. This nucleic acid 

translates to a proteinase with a C-terminal His 6 tag. The mutant 27 kDa NIa nucleic acid 
was generated by sequential PCR reactions using pTL-5495 as the template for the first 
reaction with 5TEVP1 as the 5' primer and TEVPSN: 5'-GAG TTG AGT TGC TTC TTT 
GAC TGG CTG AAA GGG TTC TTC AGG TTT GTT CAT GAA AAC TTT GTG 
20 GCC-3' (SEQ ID NO: 4) as the 3' primer to introduce the S219N mutation at the internal 
self-cleavage site. The resultant PCR product was used as the template in a second PCR 
reaction using the 5' primer 5TEVP1 and the 3' primer 3TEVH1. The nucleic acids were 
ligated into the pET15b expression vector (Novagen) using the Ncol and BamHl sites and 
transformed into E. coli DH5a (GibcoBRL) competent cells. Plasmids, pTPWT (wild 
25 type 27 kDa NIa proteinase) and pTPSN (mutant form with Ser to Asn mutation), were 
sequenced for accuracy and transformed into BL21 (DE3) (Novagen) competent cells for 
expression. 
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Example 2. Proteinase Expression and Purification 

The wild-type and mutant form 27 kDa NIa proteinases were expressed in BL21 
(DE3) cells grown at 37 °C in Luria Broth with 100 |iM Ampicillin to optical density 600 
>0.7> Cultures were induced with 400 IPTG for >4 hours. Cell pellets were 
5 harvested by centrifugation, resuspended in 50 ml buffer containing 50 mM Tris-Cl, pH 
8.0 and 300 mM NaCl per liter of cell culture, and stored at -80°C. Lysis and purification 
of the soluble fraction containing the 27 kDa NIa proteinase were performed as described 
by Parks et aL (1995) through the Ni-NTA agarose purification step, except 10% glycerol, 
300 mM NaCl, and 5 mM J3ME were included in all buffers, and the Ni-NTA agarose 

10 column was washed with buffer containing 1 0 mM imidizole and eluted with buffer 
containing 400 mM imidizole. 

For inclusion body purification, cell suspensions were thawed in cool water and 
the buffer was adjusted to give a final concentration of 500 |ig/ml lysozyme (Sigma), 200 
|ig/ml DNASE I (Boehringer Mannheim), 50 (ig/ml PMSF, 10 fig/ml Leupeptin 

15 (Boehringer Mannheim), 20 mM MgS0 4 , and 2 mM CaCl 2 . The cells were lysed as 

follows: rocking for 30 minutes at 4°C, followed by 3 freeze-thaw cycles, lysed the cells; 
adding Triton x-100 to a final concentration of 1%; and vortexing. The inclusion bodies 
were pelleted by centrifugation at 10,000 x g. Purified inclusion bodies were solubilized 
in a buffer containing 6 M GuHCl, 100 mM NaH 2 P0 4 and 10 mM Tris-Cl, pH 8.0, and 

20 stored at -80°C. 

Denaturing column chromatography was done at 4°C using 10 ml Ni-NTA 
Superflow resin (Qiagen) per liter cell culture. The column was equilibrated with 10 
column volumes (cv) of equilibration buffer (6M Urea, 100 mM NaH 2 P0 4 , and 10 mM 
Tris-Cl, pH 8.0). The solubilized inclusion bodies were thawed in a 65 °C bath and 

25 centrifuged 20 minutes at 15,000 x g at 4°C. The supernatant was loaded onto the 

column by gravity flow. The column was washed with 4 cv equilibration buffer, then 6 
cv wash buffer (6 M Urea, 100 mM NaH 2 P0 4 , and 10 mM Tris-Cl, pH 6.3). The 
proteinase was eluted in elution buffer (6 M Urea, 100 mM NaH 2 PQ 4 , and 10 mM Tris- 
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CI, pH 4.5) with a 5 minute column incubation between each fraction until a total of 6 
fractions were collected. Fractions containing the 27 kDa NIa proteinase were pooled, 
adjusted to pH 8.5 with 10 N NaOH, dialyzed 4-8 hours in storage buffer (100 mM Tris- 
Cl, pH 8.5, 500 mM NaCl, 10% Glycerol, 5 mM DTT, and 0.5 mM EDTA), and 
5 centrifuged 30 minutes at 1 5,000 x g at 4°C The supernatant, containing active, 

renatured 27 kDa NIa proteinase was separated from the pellet, containing precipitated 27 
kDa Nla proteinase, and both were stored at -80°C. The pellet is successively 
resuspended in Equilibration Buffer and redialyzed to obtain more renatured, active 
proteinase as needed. 

1 0 The 27 kDa NIa proteinase preparations were quantitated using the Bradford 

Assay (Biorad) using BSA as a standard. The yield of active 27 kDa NIa proteinase from 
the soluble preparation was estimated to be less than 10% of the total protein as this 
sample was not assayed for activity. For the insoluble preparation, the active 27 kDa NIa 
yield is reported as milligrams of proteinase obtained from the first renaturation of the 

1 5 eluate from the Ni-NTA agarose column. 

Example 3. Activity Assays 

A 17 kDa substrate containing the target cleavage site Glu J Asn~Leu-Tyr-Phe-Gln- 

Gly (SEQ ID NO: 5) produces a -15 kDa peptide when cleaved with commercially 
20 available 27 kDa NIa proteinase (GibcoBRL). Substrate cleavage activity was assayed in 

30 |il reactions with 100, 250, or 500 [iM substrate and 0.75 /ug proteinase in assay buffer 

( 50 mM Tris-Cl, pH 8.0, 1 mM DTT, and 0.5 mM EDTA) incubated at 30°C for 1 hour. 

Samples were electrophoresed in 15% SDS-PAGE gels, stained with Coomassie Blue, 

and protein bands were quantitated using a gel documentation system (Kodak). The net 
25 intensities of full-length and cleaved substrate bands were used to calculate the specific 

activity for the substrate at each substrate concentration. 

Self-cleavage activity was assayed in 40 |j,l reactions containing 7.5 \xg 27 kDa 

NIa proteinase in 50 mM Tris-Cl, pH 8.0, ImM DTT, and 0.5 mM EDTA, incubated at 
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4°C, -20°C, and -80°C. At 0.5, 1, 2, and 5.5 weeks, 10 ^1 sample was removed and 
boiled with denaturing SDS-PAGE loading buffer, loaded onto 15% SDS-PAGE gels, and 
quantitated as per above. The net intensities of full-length and cleaved proteinase bands 
were used to calculate the percent proteinase cleaved at each time point. 

5 

Results 

pTPWT and pTPSN encode peptides with respective molecular weights of 28,563 
Daltons (about 28.5 kDa) and 28,590 Daltons (about 28.5 kDa). During cloning, the 
proteinase N-terminus was altered from wild type Gly-Glu-Ser- to Met-Ala-Glu-Ser- 
10 (SEQ ID NO: 6) and a -Leu-His-His-His-His-ffis-His (SEQ ID NO: 7) tag was added to 
the C-terminus. 

Soluble preparations of 27 kDa NIa proteinase yielded minimal quantities of 
proteinase (Table 1). 27 kDa NIa proteinase detected as a band in cell lysate is missing 
in clarified lysate but present in the inclusion body preparation (Figure 1, lanes 1-3 and 7- 

15 9). Ni-NTA agarose purification under denaturing conditions yields purer proteinase than 
purification under soluble conditions (Figure 1, lanes 4, 5, 10, and 1 1). In samples 
containing 4.5 |xg total protein, a single contaminant is detected in renatured 27 kDa NIa 
proteinase after the denaturing purification indicating >95% purity (Figure 1, lanes 13- 
15). Final yields of >10 mg renatured, active 27 kDa NIa proteinase were regularly 

20 obtained after a single rehaturation step for both TPWT and TPSN (Table 1). Final 

concentrations of several renatured preps indicate a maximum solubility of ~1 .5 mg/ml 
(data not shown). 
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Table 1. Purification Yields for Soluble vs Insoluble Procedures (per liter cell culture). 





Total protein Ni-NTA 
load (mg) 


Total Ni-NTA-pure 
protein (mg) 


active rTEVP 
(mg)* 


% yield ** 


TPWT 


Soluble prep 


977 


4.08 


<0.41 


0.04 




Insoluble prep 


80 


30.29 


>10.96 


1.04 




TPSN 


Soluble prep 


946 


3.67 


<0.37 


0.04 




Insoluble prep 


81 


32.35 


>12.84 


1.25 




* Active rTEVP for the soluble preps are estimated to be <10% of total purified protein. Data reported for 
insoluble preps is reported as greater than the quantity of protein obtained from a single round of renaturation. 




** % yield active rTEVP from total cellular protein 



Note: Yields of total protein, active 27 kDa NIa proteinase, and overall percent yield are shown for both the 
soluble and insoluble purification protocols. 
5 Protein quantitation at each purification step was estimated using the Bradford assay with a BSA standard 
curve. 



The specific activity for substrate cleavage is defined as the picomole substrate 
cleaved per picomole 27 kDa NIa proteinase in 1 hour at 30° C. The data indicate that 

10 TPWT and TPSN have similar activity to each other and to the commercially available 
proteinase at all substrate concentrations tested (Figure 2). 

Both the commercially available 27 kDa NIa proteinase and purified TPWT 
contain the truncated self-cleavage product in the final preparation, while the TPSN 
preparation contains only full-length proteinase (Figure 1, lanes 13-15). The self- 

15 cleavage activity of TPWT is similar to the commercially available proteinase at 4°C and 
-20 °C. TPSN has reduced self-cleavage activity at 4°C, and no self-cleavage activity at 
-20°C (Figures 3, 4 a, and 4b). None of the proteinase samples tested exhibited significant 
self-cleavage activity at -80 °C during our 5.5 week assay (data not shown). 
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Conclusions 

The present invention shows that wild type, as well as mutant, histidine-tagged 27 
kDa NIa constructs can be induced to express >95% of the proteinase in the insoluble 
5 fraction (Figure 1). The present invention provides a novel method of denaturing 

purification of the insoluble fraction followed by renaturation of the peptide that yields up 
to 10 times as much active 27 kDa NIa proteinase as the soluble preparation reported by 
Parks et al (1995). This allows for stock preparations of >95% pure, active 27 kDa NIa 
proteinase to be made for general use. 

10 Parks et al (1 995) has shown that the truncated form of 27 kDa NIa proteinase has 

significantly less substrate cleavage activity than the full-length form. The proteinase 
continues to cleave itself over time when stored at 4°C and -20°C, potentially reducing 
the quantity of fully active enzyme in an 27 kDa NIa proteinase stock over time. Self- 
cleavage appears to be arrested at -80 °C. 

1 5 The Ser2 1 9- Asn mutation in the mutant form of the 27 kDa NIa proteinase, 

provided by the present invention, significantly inhibits self-cleavage activity, allowing 
for increased yields of full-length, fully active 27 kDa NIa proteinase, from either soluble 
or insoluble preps. The present invention also permits long-term storage of 27 kDa NIa 
stocks at -20°C, and short-term storage at 4°C. 

20 The 27 kDa proteinase and its mutant form are valuable tools for protein 

purification protocols because of its target site specificity and its activity under a wide 
variety of conditions. The present invention by utilizing a denaturing preparation of the 
TPSN mutant, enables production of large stocks of rTEVP with consistent activity 
characteristics. 

25 

It should be understood that the foregoing discussion and examples merely present 
a detailed description of certain preferred embodiments. It therefore should be apparent 
to those of ordinary skill in the art that various modifications and equivalents can be made 
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without departing from the spirit and scope of the invention. All journal articles, other 
references, patents, and patent applications that are identified in this patent application are 
incorporated by reference in their entirety. 
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CLAIMS 

1 . An isolated mutant proteinase having a molecular weight of about 27 kDa 
and reduced self-cleavage activity relative to the self-cleavage activity of its wild-type 
proteinase. 

2. The mutant proteinase of claim 1, wherein the mutant proteinase 
comprises an amino acid sequence in which the residue corresponding to Ser 219 of the 
27 kDa NIa proteinase is replaced with another residue. 

3. The mutant proteinase of claim 2, wherein the residue replacing Ser 21 9 of 
27 kDa NIa proteinase is Asn. 

4. A composition comprising the mutant proteinase of claim 1 and a carrier. 
15 

5. A fusion protein comprising the mutant proteinase of claim 1 fused to a 
heterologous polypeptide. 

6. The fusion protein of claim 5, wherein the heterologous polypeptide 
20 consists of six histidines. 

7. An isolated nucleic acid molecule comprising a sequence encoding a 
mutant proteinase having a molecular weight of about 27 kDa and reduced self-cleavage 
activity relative to the self-cleavage activity of its wild-type proteinase. 

25 

8. An isolated nucleic acid molecule of claim 7, wherein the nucleic acid 
molecule encodes an amino acid sequence in which the residue corresponding to Ser 219 
of the 27 kDa NIa proteinase is replaced with another amino acid. 



5 



10 
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9. An isolated nucleic acid of claim 8, wherein the nucleic acid comprises a 
sequence that encodes an amino acid sequence in which the residue corresponding to Ser 
2 1 9 of the 27 kDa NIa proteinase is Asn. 



10. A vector comprising the nucleic acid of claim 7. 

1 1 . An expression vector comprising the nucleic acid of claim 7. 
10 12. A host cell comprising the expression vector of claim 1 1 . 

13. A method of producing a proteinase having a molecular weight of about 27 
kDa and reduced self-cleavage activity relative to the self-cleavage activity of its wild- 
type proteinase activity comprising cultivating the host cell of claim 12 under conditions 

1 5 that allow expression of the proteinase. 

14. A nucleic acid encoding a fusion protein comprising a proteinase, having a 
molecular weight of about 27 kDA and reduced self-cleavage activity relative to the self- 
cleavage activity of its wild-type proteinase, fused to a heterologous protein. 

20 

1 5. The nucleic acid of claim 14, wherein the heterologous protein consists of 
six histidines. 

16. A method of purifying a polypeptide that forms inclusion bodies in a cell 
25 comprising the following steps: 

a) obtaining cells expressing the polypeptide; 

b) lysing the cells; 

c) pelleting inclusion bodies by centrifugation of the lysed cells; 
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agent; 



d) centrifuging the inclusion bodies; 

e) solubilizing the inclusion bodies in buffer containing solubilization 

f) centrifuging the solubilized inclusion bodies to obtain a supernatant 
containing the polypeptide; 

g) loading the supernatant over a single denaturing column; 

h) collecting the proteinase; and 

i) renaturing the collected polypeptide. 



10 17. The method of claim 16, wherein the polypeptide has a molecular weight 

of about 27 kDa and reduced self-cleavage activity relative to its wild-type polypeptide. 

1 8 . The method of claim 1 6, wherein the polypeptide is selected from the 
group consisting of tobacco etch virus (TEV) 27 kDa Mia proteinase and mutant TPSN 27 

1 5 kDa NIa proteinase. 

19. The method of claim 16, wherein in step (g) the denaturing column 
contains Ni-NTA. 

20 20. A method of using the proteinase of claim 1, comprising incubating the 

proteinase with a protein for a sufficient amount of time to allow cleavage of the protein. 
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